Similarity Search on Spatio-Textual Point Sets

نویسندگان

  • Christodoulos Efstathiades
  • Alexandros Belesiotis
  • Dimitrios Skoutas
  • Dieter Pfoser
چکیده

User-generated content on the Web increasingly has a geospatial dimension, opening new opportunities and challenges in location-based services and location-based social networks for mining and analyzing user behaviors and patterns. The applications of such analysis range from recommendation systems to geo-marketing. Motivated by these needs, querying and analyzing spatio-textual data has received a lot of attention over the last years. In this paper, we address the problem of matching point sets based on the spatio-textual objects they contain. This is highly relevant for users associated with geolocated photos and tweets. We formally define this problem as a Spatio-Textual Point-Set Join query, and we introduce its top-k variant. For the efficient treatment of such queries, we extend state-of-the-art algorithms for spatio-textual joins of individual points to the case of point sets. Finally, we extensively evaluate the proposed methods using large scale, real-world datasets from Flickr and Twitter.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SEAL: Spatio-Textual Similarity Search

Location-based services (LBS) have become more and more ubiquitous recently. Existing methods focus on finding relevant points-of-interest (POIs) based on users’ locations and query keywords. Nowadays, modern LBS applications generate a new kind of spatio-textual data, regions-of-interest (ROIs), containing region-based spatial information and textual description, e.g., mobile user profiles wit...

متن کامل

Clue-based Spatio-textual Query

Along with the proliferation of online digital map and locationbased service, very large POI (point of interest) databases have been constructed where a record corresponds to a POI with information including name, category, address, geographical location and other features. A basic spatial query in POI database is POI retrieval. In many scenarios, a user cannot provide enough information to pin...

متن کامل

Hybrid-LSH for Spatio-Textual Similarity Queries

Locality Sensitive Hashing (LSH) is a popular method for high dimensional indexing and search over large datasets. However, little efforts have put forward to utilizing LSH in mobile applications for processing spatio-textual similarity queries, such as find nearby shopping centers that have a top ranked hair salon. In this paper, we present hybrid-LSH, a new LSH method for indexing data object...

متن کامل

STEWARD: demo of spatio-textual extraction on the web aiding the retrieval of documents

A spatio-textual sear h engine, termed \STEWARD" is demonstrated where do ument similarity is based on both the textual similarity as well as the spatial proximity of the lo ations in the do ument to the spatial sear h input. STEWARD's performan e is enhan ed by the presen e of a do ument tagger that is able to identify textual referen es to geographi al entities. The userinterfa e of STEWARD p...

متن کامل

Identifying and Ranking the Important Textual and Paratextual Elements in Fiction Retrieval

Purpose: The purpose of this study is to identify the textual and paratextual elements in retrieving fiction from the readers’ perspective in order to provide the most appropriate access points for the readers and to improve access to fictions based on the readers’ needs. Method: The current research is an applied study in terms of purpose, applying a mixed method that was conducted using the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016